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Abstract 



The purpose of this study was to validate the development and proposal of what the 
authors call STEMs (Standards Tests to Evaluate Mastery) and have defined them as 
calibrated classroom assessments that increase student motivation and provide authentic 
evaluation of student learning. Theoretical and empirical research on classroom 
assessment and grading, and high school Algebra failure rates were studied to provide the 
justification for the development and proposal of STEMs. The research indicates that 
there is a need for classroom assessments that increase student motivation, reduce grade 
inflation and deflation, and provide teachers with an authentic evaluation of student 
learning of the content. The research shows that student motivation to complete and 
expend energy on tasks is largely defined by the assessment environment created in the 
classroom. Students, educators, and parents would benefit from the development of 
calibrated classroom assessments that increase student motivation and success. (Contains 
1 table) 




STEMs (Standards Tests to Evaluate Mastery) are calibrated 
assessments that are designed to motivate students to take academic risks and 
make academic decisions about their learning while providing teachers with an 
authentic measure of the student’s level of understanding and achievement of 
academic standards. They are based on the concept of conation. “Conation is 
defined as the mental or behavioral tendency to act or strive to act. It involves 
both motivation... being disposed to act or wanting to act, and volition... taking 
steps or causing oneself to act. Both motivation and volition are involved in 
student effort in the classroom.” (Brookhart, 1997) 

The justification for the efficacy of STEMs will follow from an analysis of 
their structure and how that structure foments conation, and from empirical and 
anecdotal evidence of current practices. First, we will explain the physical 
structure of STEMs and provide a justification for that structure. Next, we will 
look at current research that substantiates their fundamental premise. Lastly, we 
will provide anecdotal evidence from years of observation of current practices 
and results from many classrooms and schools to demonstrate the need for 
STEMs. 

STEMs are 35-problem assessments where the questions are categorized 
and designed to fit into five levels; the five levels correspond to Bloom’s 
Taxonomy. There are ten level 1 , ten level 2, ten level 3, and five level 4 and 5 
questions. Level 1 questions correspond to Bloom’s “knowledge” level of 
learning, where students demonstrate their understanding of the standard at the 
most basic level. Level 2 questions correspond to the “comprehension” level of 
learning. Level 3 questions correspond to the “application” level of learning. 
Lastly, level 4 and 5 questions correspond to the “analysis”, and “synthesis” and 
“evaluation” levels of learning, respectively. Each STEM has the standard written 
at the top, and below the standard the different levels are explained so that 
students are reminded of what is being assessed and the level of learning they 
choose to attempt. Each set of problems is preceded by a heading indicating 
their level. 

The two elements of conation drive and serve to explain the structure of 
STEMs, thus the development of conation in the students begins with the 
structure itself. The motivational piece of conation is developed by the 
presentation of the questions. In STEMs, students are able to readily identify the 
level of questioning they choose to attempt and feel confident with. They will not 
have to guess at the level of questioning and sift through an entire test to find the 
problems they can and will answer. Gipps (1994a) found that student motivation 
is affected by the way instructional activities are presented in the classroom. 
Motivation follows from a feeling of success, and as students are free to choose 
the level they will be successful with on a STEM, the motivation will follow. As 
Brookhart (1997) pointed out, “judgments about what they can do will influence 
what they will do or how much effort they will expend.” 




Current classroom assessment practices will spread a varied number of 
levels throughout the test, forcing the students to attempt every problem to 
complete the test. For example, a traditional 35 question test may have 
“knowledge” level questions at the beginning, middle, and end, but some 
students may never get to the ones in the middle and end because they lose too 
much time on the higher order questions they encounter and may likely not 
answer correctly anyway. The end result is an incomplete exam and a student 
feeling like they don’t know anything. Schunk (1994) noted that students make 
decisions about their self-efficacy. The opposite is also true; why have a more 
advanced student attempt lower level problems if they can show mastery with the 
higher level questions? Current practices focus on quantity, and students are 
graded on how many they answer correctly, irrespective of the level of questions 
they answered. STEMs step away from assessing on quantity and focus on the 
quality of the questions. As a result, motivation shifts from a focus on answering 
“as many as I can” to “the ones I know I can.” 

The other conative element of STEMs, volition, follows from the fact that 
they do not deceive. On a STEM, a student will never have to guess at their 
grade or become disappointed with a grade they did not expect. There are fewer 
things more stressful and tense, to a student than anticipating a low grade on an 
exam; and fewer things more inhibiting than repeated low exam scores. 
“Information that a student can use to make himself or herself more competent is 
intrinsically motivating” (Ryan, Connell, & Deci, 1985). STEMs empower 
students to choose the grade they will receive on the exam depending on their 
level of understanding. When students receive the STEM, they evaluate the 
different levels of questions, and make a determination as to which level they can 
successfully respond to. For example, on one exam they may feel comfortable 
answering level 3 questions and on another level 4; they attempt only the level 
they choose. So, a 35 question test becomes a 10 question one with a student 
confident of success at that level and a teacher knowing exactly at what level of 
understanding the student lies. The result is students motivated, as opposed to 
resistant, to taking exams; the fear of failure has been removed. Nolen and 
Flaladyna (1 990) found that students' perceptions that their teachers valued 
independent thinking and mastery of material were positively related to task 
orientation and to beliefs in the value of strategies. With STEMs, there is no 
failure, just differing levels of understanding and performance. 

The current need for calibrated exams like STEMs arises from many years 
of low grades and poor performance in core subjects, particularly math, and 
specifically Algebra. There are several articles and studies that attest to Algebra 
failure rates around the country between 20 and 90 percent (see table one). 
Although it can be argued that the low grades and poor performance result from 
bad “teaching”, ultimately assessment is the element of teaching that teachers 
use to judge performance and assign marks (grades). “The grades teachers... 
assign to students’ work and performance have long been identified by those in 
the measurement community as prime examples of unreliable measurement.” 




(Guskey, 2006). So, it is possible to have good instruction (teaching) but low 
performance on badly written and designed assessments. STEMs do not 
interfere with a teacher’s style of instruction (how she teaches) but rather provide 
a genuine measure of a student’s understanding of the content. 

Grade inflation and deflation are also reduced with STEMs. Conventional 
grading practices involve a percentage of correct questions from the total number 
of questions, commonly known as point-system grading (Docan, 2006). So, on a 
conventional 35 question test, a student would have to score 31 .5 to achieve 
90% correct, 28 to achieve 80%, 24.5 to achieve 70%, 21 to achieve 60%, which 
normally correspond to A, B, C, and D grades respectively. However, 
conventional exams contain two inherent elements that compromise the 
accuracy with which they measure student understanding of the material. First, 
conventional teacher created exams tend to have a large number of lower level 
order of questions spread throughout the exam and a “current propensity to focus 
on low-level skills” (Shepard, 2000). So, while an A paper means that the 
student answered 90% of the questions correctly, that doesn’t mean that the 90% 
are at a higher cognitive level. This type of situation causes grade inflation: 
students with high grades but a minimal level of understanding. Second, the 
large number of problems on the exams places a greater emphasis on speed 
than showing mastery. A student taking his/her time may only reach question 25 
and answer all of them correctly, but the score will remain a C, not because the 
student has not demonstrated understanding, but because he/she did not 
complete all 35. This may cause grade deflation. 

An understanding of achievement goal theory (AGT) will help in 
appreciating how and why the inherent structure of STEMs eliminates the 
aforementioned problems. AGT proposes that the goal structure of an 
environment might affect students’ motivation, cognitive engagement, and 
achievement within that setting (Ames & Archer, 1988). Wolters (2004) suggests 
that there are essentially two goal structures that students perceive in 
classrooms, mastery and performance. Mastery goals primarily focus on 
engaging in achievement behavior through developing competence (Kaplan, 
Middleton, Urdan, and Midgley, 2002). With performance goals, success is 
demonstrated through extrinsic rewards, demonstrating ability, and doing better 
than other students (Midgely et al., 1998). Research examining secondary school 
students indicated that a mastery goal orientation was associated positively with 
students’ self reported effort and persistence at academic tasks (Miller, Greene, 
Montalvo, Ravindran, & Nichols, 1996). Additionally, Condry and associates 
(1977, 1978) demonstrated that learners’ curiosity, interest, and mastery of a 
subject remain more prevalent when rewards are not involved. 

With STEMs, because students are able to identify and choose the level of 
questions desired, and the questions are calibrated to Bloom’s Taxonomy, the 
teacher and student are assured that the student will receive a grade that reflects 
the level of understanding. So, students completing five level 4 and 5 problems 




correctly undeniably deserve an A as those problems are all at an evaluation, 
analysis, and/or synthesis level; this eliminates grade inflation. Also, as students 
are required to complete only ten problems per level on levels 1 through 3, and 
only five for levels 4 and 5, STEMs place due emphasis on student 
demonstration of mastery and not on the rate at which they answer. STEMs 
eliminate the doubts and discrepancies regarding student achievement levels. 

An intrinsic feeling of success and autonomy are large motivators in any 
endeavor and at any age. According to Wolters (2004), the types of tasks 
assigned, the grading procedures, and the degree of autonomy students are 
provided affect the achievement goals students adopt, and thus embody the 
classroom goal structure. Hence, compulsion cannot foster intrinsic feelings of 
success. When people are validated for what they know and do, the intrinsic 
feeling of success follows; success breeds motivation. Years of anecdotal and 
empirical data have demonstrated that motivation is one of the largest factors in 
the success of a student, and STEMs provide a way for teachers to instill that 
motivation by validating the different levels of understanding that they find in 
every class. 
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Table 1 



Failure 

rate 


Author 


Source 


44% 


Duke Helfand 


A Formula for Failure in L.A. Schools, L.A. Times, 
2006 


8.9% 
increase 
in failure 


Debra Viadero 


Alqebra-for-AII Policy Found to Raise Rates Of 


Failure in Chicaao, www.edweek.ora, Vol. 28, Issue 
24, Page 1 1 


84% 


Emily 

Richmond 


Students show math aains despite hiah failure rates, 


http ://www. lasvegassu n . co m/news/2009/feb/04/stude 
nts-show-math-gains-despite-high-failure-rate/ 


20% & 
33% 


Meirowsky, 

Fellers, 

Wertenberger 


Expandinq Hiqh School Math Curriculum, 
Proceedings of the 3rd Annual GRASP Symposium, 
Wichita State University, 2007 


20%-30% 


Lori Higgins 


Alqebra 1 stumpinq hiqh school freshmen, Detroit 
Free Press, 2008 


60% 


Sidener, 

Thomas, 

Turner 


Alqebra Alert, Superintendent’s Urban Principal 
Initiative (SUPI), 2006 


55% 


Rick DuFour 


Teachers Kev to Reversinq Hiqh Failure Rate in 


Math, htto://www.allthinasplc.info/wordoress/?p=87, 
2009 






